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33. (Currently amended) A system for automatically cataloguing documents located in multiple 
heterogeneous repositories, the system comprising: 

a scanr :.ng tool for scanning the multiple heterogeneous repositories to collect keywords 
for the docume nts located therein; 

a keyw ;>rd index to the documents built using the collected keywords; 

a mapping tool for mapping the documents using the keyword index to one or more 
classes, each of the one or more classes including keywords representative of that class; and 

a computing device for creating metadata indicative of each of the documents and 
cataloguing ea ::h of the documents in an integrated library according to the metadata in a meta- 
index, wherek the metadata for each of the documents indexed within the meta- index is stored in 
a pre-defined rata structure including at least one of the following attributes a uniform resource 
locator (URL), a title, an author, an abstract, a collection, a keyword, one or more matched 
words, a path. ;i classmark, a classification date and a last modified dat e, further wherein the 
meta-index ret tins characteristics of each of the multiple heterogeneous repositories as applied to 
each of the do; : uments such that a user may access one or more of the documents within the 
multiple heterogeneous repositories utilizing the meta-index, and further wherein the 
characteristics of the multiple heterogeneous repositories are transparent to the user when one or 
more of the do : ruments are accessed using the meta-index . 



34. (Cancelled) 
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35. (Canceller) 

36. (Previous!; ■ presented) The system according to claim 33, wherein the metadata is stored in 
extensible Markup Language (XML) format. 

37. (Previous!; - presented) The system according to claim 33, wherein the metadata is stored in 
Resource Description Framework (RDF) format. 

38. (Previously presented) The system according to claim 33, wherein the scanning tool is at 
least one spide ::. 

39. (Previous!;, presented) The system according to claim 33, wherein the mapping tool is a 
domain ontology. 

40. (Previous lv presented) The system according to claim 39, wherein the domain ontology is a 
classification hierarchy. 

41. (Previousl , r presented) The system according to claim 33, wherein the mapping tool is a 
neural networl;. 

42. (Currently amended) A method for automatically cataloguing documents located in multiple 
heterogeneous repositories, comprising: 
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scannir.g the multiple heterogeneous repositories to collect keywords from the documents 
located therein; 

building a keyword index to the documents stored in the multiple heterogeneous 
repositories us ng the collected keywords; 

mappir ij the documents using the keyword index into predetermined classes, wherein the 
mapping is pei rbrmed using at least one mapping tool; 

creatin ;; metadata information, including identification of the predetermined class, for the 
documents; an :l 

cataloguing each of the documents in an integrated library according to the metadata in a 
meta-index, wherein the metadata for each of the documents indexed within the meta-index is 
stored in a pre defined data structure including at least one of the following attributes a universal 
resource locator, a title, an author, an abstract, a collection, a keyword, one or more matched 
words, a path, u classmark, a classification date and a last modified date A and further wherein the 
meta-index retains the characteristics of each of the multiple heterogeneous repositories as 
applied to eacli of the documents such that a user may access one or more of the documents 
within the multiple heterogeneous repositories utilizing the meta-index . and wherein the 
characteristics of the multiple heterogeneous repositories are transparent to the user when one or 
more of the documents are accessed using the meta-index . 

43. (Previous!; / presented) The method of claim 42, wherein scanning the at least one 
information re:>ository to collect keywords is performed by a spider. 
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44. (Previously presented) The method of claim 42, wherein the metadata information is stored 
in the eXtensil le Markup Language (XML) format. 

45. (Previously presented) The method of claim 42, wherein the metadata information is stored 
in the Resourc ■: Description Framework (RDF) format. 

46. (Currently amended) A method for automatically cataloguing documents located on at least a 
first and secon :l website, comprising: 

scannir g the at least a first and second website to collect keywords from the documents 
located therein, wherein documents located on a first website are in a first format and documents 
located on a se ;;ond website are in a second format; 

buildin a keyword index to the documents stored on the at least a first and second 
website using ue collected keywords; 

mappir j the documents using the keyword index into predetermined classes, wherein the 
mapping is pei brmed using at least one mapping tool; 

creatin ;; metadata information, including identification of the predetermined class, for the 
documents; an I 

cataloguing each of the documents in an integrated library according to the metadata in a 
meta-index, wh erein the metadata for each of the documents indexed within the meta-index is 
stored in a thir :l format and further wherein the meta-index retains the first foimat and the second 
format, respecr vely, for the documents in each of the at least a first and second websites such 
that a user may access one or more of the documents within the at least a first and second 
website utilizing the meta-index . and further wherein the characteristics of the at least a first and 
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second websit e g are transparent to the user when one or more of the documents ar e accessed 
using the meta index . 

47. (Previously presented) Tlie method of claim 46, wherein scanning the at least a first and 
second websit< : to collect keywords is performed by a spider. 

48. (Previous);' presented) The method of claim 46, wherein the metadata is stored in the 
extensible Markup Language (XML) format. 

49. (Previously presented) The method of claim 46, wherein the metadata is stored in the 
Resource Desc ription Framework (RDF) format. 
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